Two-stage sampling for etiologic studies. Sample size and power.

نویسندگان

  • D Schaubel
  • J Hanley
  • J P Collet
  • J F Bolvin
  • C Sharpe
  • H I Morrison
  • Y Mao
چکیده

Preexisting computerized databases are potentially valuable sources of epidemiologic data. Since such databases are infrequently created specifically for etiologic research, data may be available for the exposure of interest and, through record linkage, for the endpoint of interest, but lacking for potential confounders. Because of the size of these databases, two-stage sampling is an efficient alternative to surveying the entire study population for confounder data. At stage 1, information on exposure and disease status is obtained for the entire study population. Confounder data are collected for probability-selected subsamples at stage 2. Logistic regression is performed on the stage 2 samples, with the parameter estimates and variances appropriately corrected to account for the stage 1 data. In this paper, the authors present methods for determining the required stage 2 sample size in the case of categorical exposure and confounding variables. Sample size tables, power curves, and a computer program have been produced to accommodate a binary exposure and a single binary confounder. With the increasing availability of preexisting yet incomplete databases, the potential for use of two-stage sampling will greatly increase in the future. This investigation provides a basis for estimating the number of participants to sample for the collection of confounder data at the second stage.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Sample size Determination for Longitudinal Studies with Continuous Response using Marginal Models

Introduction Longitudinal study designs are common in a lot of scientific researches, especially in medical, social and economic sciences. The reason is that longitudinal studies allow researchers to measure changes of each individual over time and often have higher statistical power than cross-sectional studies. Choosing an appropriate sample size is a crucial step in a successful study. A st...

متن کامل

Power in the phenotypic extremes: a simulation study of power in discovery and replication of rare variants.

Next-generation sequencing technologies are making it possible to study the role of rare variants in human disease. Many studies balance statistical power with cost-effectiveness by (a) sampling from phenotypic extremes and (b) utilizing a two-stage design. Two-stage designs include a broad-based discovery phase and selection of a subset of potential causal genes/variants to be further examined...

متن کامل

Determining the sample size required to compare vegetation and soil characteristics in two independent groups using effect size

Extended Abstract Background and objectives: One of the important steps in assessing rangeland vegetation is determining the sample size. Adequacy of sample size and its determination is always one of the main concerns of rangeland vegetation analyzer. There are two general methods for determining the sample size in rangeland science: graphic and statistical methods. In this study, the sample...

متن کامل

Bayesian Determination of Sample Size in Longitudinal Studies with Binary Responses Using Random Effects Models

Sample size determination is important in all statistical studies including longitudinal studies. This is usually done by considering a target power to reduce the costs of sampling. Choosing the right sample size using efficient methods, ensures that the researcher achieve goal of the study, by spending the least amount of energy, time and money. In this article, using a method based on simulat...

متن کامل

Design of Economic Optimal Double Sampling Design with Zero Acceptance Numbers

  In zero acceptance number sampling plans, the sample items of an incoming lot are inspected one by one. The proposed method in this research follows these rules: if the number of nonconforming items in the first sample is equal to zero, the lot is accepted but if the number of nonconforming items is equal to one, then second sample is taken and the policy of zero acceptance number would be ap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • American journal of epidemiology

دوره 146 5  شماره 

صفحات  -

تاریخ انتشار 1997